O. 



I 



<110> SANTI, Daniel 
PECK, Lawrence 
KEALEY, James 
DAYEM, Linda 



SEQUENCE LISTING 



<120> Heterologous Production of Polyketides 



<130> 010025.01 

<140> 09/699,136 
<141> 2000-10-27 

<150> 60/161,703 
<151> 1999-10-27 

<160> 6 

<170> FastSEQ for windows version 4.0 

<210> 1 
<211> 1917 
<212> DNA 

<213> Propionibacterium f reudenreichii ssp . shermani i 
<400> 1 

atgagcagca cggatcaggg gaccaacccc gccgacactg acgacctcac tcccaccaca 60 

ctcagcctgg ccggggattt ccccaaggcc actgaggagc agtgggagcg cgaagttgag 120 

aaggtactca accgtggtcg tccaccggag aagcagttga ccttcgccga gtgtctgaag 180 

cgcctgacgg ttcacaccgt cgatggcatc gacatcgtgc cgatgtaccg tccgaaggac 240 

gccccgaaga agctgggtta ccccggcgtc gcacctttca cccgcggcac cacggtgcgc 300 

aacggcgaca tggatgcctg ggacgtgcgc gccctgcacg aggatcccga cgagaagttc 360 

acccgcaagg cgatcctcga aggcctggag cgtggcgtca cctccctgtt gctgcgcgtt 420 

gatcccgacg cgatcgcacc cgagcacctc gacgaggtcc tctccgacgt cctgctggaa 480 

atgaccaagg tggaggtctt cagccgctac gaccagggtg ccgccgccga ggccctggtg 540 

agcgtctacg agcgctccga caagccggcg aaggacctgg ccctcaacct gggcctggat 600 

cccatcgcgt tcgcagccct gcagggcacc gagccggatc tgaccgtgct cggtgactgg 660 

gtgcgccgcc tggcgaagtt ctcgccggac tcgcgcgccg tcacgatcga cgcgaacatc 720 

taccacaacg ccggtgccgg cgacgtggca gagctcgcct gggcactggc caccggcgcg 780 

gagtacgtgc gcgccctggt cgagcagggc ttcaccgcca ccgaggcctt cgacacgatc 840 

aacttccgtg tcaccgccac ccacgaccag ttcctcacga tcgcccgtct tcgcgccctg 900 

cgcgaggcat gggcccgcat cggcgaggtc ttcggcgtgg acgaggacaa gcgcggcgcc 960 

cgccagaatg cgatcaccag ctggcgtgac gtcacgcgcg aagaccccta tgtcaacatc 1020 

cttcgcggtt cgattgccac cttctccgcc tccgttggtg gggccgagtc gatcacgacg 1080 

ctgcccttca cccaggccct cggcctgccg gaggacgact tcccgctgcg catcgcgcgc 1140 

aacacgggca tcgtgctcgc cgaagaggtg aacatcggcc gcgtcaacga cccggccggt 1200 

ggctcctact acgtcgagtc gctcacccgc agcctggccg acgccgcctg gaaggaattc 1260 

caggaggtcg agaagctcgg tggcatgtcg aaggccgtca tgaccgagca cgtcaccaag 1320 

gtgctcgacg cctgcaatgc cgagcgcgcc aagcgcctgg ccaaccgcaa gcagccgatc 1380 

accgcggtca gcgagttccc gatgatcggg gcccgcagca tcgagaccaa gccgttcccc 1440 

gccgctccgg cgcgcaaggg cctggcctgg catcgcgact ccgaggtgtt cgagcagctg 1500 

atggatcgct ccaccagcgt ctccgagcgc cccaaggtgt tcctggcctg cttgggcacc 1560 

cgtcgcgact tcggtggccg cgagggcttc tcgagcccgg tgtggcacat cgccggcatc 1620 

gacaccccgc aggtcgaagg cggcaccacc gccgagatcg tcgaggcatt caagaagtcg 1680 

ggcgcccagg tggccgacct ctgctcgtcc gccaaggtct acgcgcagca gggacttgag 1740 

gtcgccaagg cactcaaggc cgccggcgca aaggccctgt acctgtcggg cgccttcaag 1800 

gagttcggtg atgacgccgc cgaggccgag aagctgatcg acggacgcct gtttatgggc 1860 

atggatgtcg tcgacaccct gtcctccacc cttgatatct tgggagtcgc gaagtga 1917 

<210> 2 

Page 1 



<211> 2187 
<212> DNA 

<213> Propionibacteri um f reudenrei chi i ssp. shermanii 



<400> 2 

gtgagcactc 

gccgcacgac 

gccgagcaga 

ctggacacct 

ttccgtccct 

ttctaccgcc 

acccaccgtg 

gtggccatcg 

atgagcgtgt 

accgccgagg 

ctcaaggagt 

atctctgaga 

tccggctacc 

gccgacggtg 

gcgccgcgtc 

ctgcgtgccg 

aagtcgatga 

gtctacaaca 

cagtcgctgc 

cgcatcgccc 

gacccgtgga 

tggggtcaca 

atccccaaga 

cgccagccgc 

ctcaaggtgg 

gccgagcgcg 

aaccccgacg 

gccatggcga 

gcccagattc 

gaggaagcac 

ctgctggcca 

tatgccgacc 

gcacgtcagg 

catctgacgc 

ctcatcaccg 

gccgtggaga 

aaactgcggg 



tgccccgttt 
gcttcgagga 
ttccggttgg 
acgcaggtat 
ggacgattcg 
gcaaccttgc 
gctacgactc 
actccatcta 
ccatgaccat 
agcagggcgt 
tcatggttcg 
tcttcgccta 
acatgcagga 
ttgactacat 
tgtccttctt 
cgcgcatgtt 
gcctgcgcac 
acgtcgtgcg 
acacgaactc 
gtaacaccca 
gcggctcggc 
tccaggaggt 
tgcgcatcga 
tgatcggcgt 
acaactccac 
atcccgagaa 
acaaggatcc 
cggtcggcga 
gcaccatctc 
gcgagctcgt 
agatgggcca 
tcggtttcga 
ccgtcgaggc 
tggttccggc 
tgggcggcgt 
tctacacccc 
cttcgctcga 



tgattcagtt 
actggccgcc 
caccctgttc 
cccgccgttc 
ccagtacgcc 
ggccggccag 
ggacaatccc 
tgacatgcgc 
gaacggcgcc 
caagcccgag 
taacacctac 
cacgagtgcc 
agccggcgcc 
ccgcgccggc 
ctggggcatc 
gtgggccaag 
ccactcgcag 
tacctgcatc 
gctcgacgag 
gctgttcctg 
atacgtcgag 
cgagaaggtc 
ggaagccgcc 
gaacaagtac 
ggtgctcgcc 
ggtcaaggcc 
ggatcgcaac 
gatgagcgac 
cggtgtgtac 
tgaggaattc 
ggacggtcac 
cgtcgacgtg 
cgatgtgcac 
cctgcgcaag 
gatccctgag 
cggcaccgtc 
tgcctag 



gacctcggca 
aaggccggca 
aacgaagacg 
gtccacggcc 
ggtttctcca 
aagggcctgt 
cgcgtcgccg 
gagctgttcg 
gtgctgccga 
cagctcgccg 
atctacccgc 
aatatgccga 
acggccgaca 
gagtcggtgg 
ggcatgaact 
ctggtgcatc 
acctccggtt 
gaggccatgg 
gccatcgccc 
cagcaggaat 
gagctcacct 
ggcggcatgg 
gcccgcaccc 
cgcctggagc 
gagcagaagg 
gccctcgaca 
ctgctgaagc 
gcgctcgaga 
tcgaaggaag 
gagcaggccg 
gaccgtggcc 
ggcccgctgt 
gtggtgggcg 
gagctggaca 
caggacttcg 
attccggagt 



atgccccggt 
ccggagaggc 
tctacaagga 
cgtatgcaac 
cggccaagga 
cggttgcctt 
gtgacgtcgg 
ccggcattcc 
tcctggccct 
ggacgatcca 
cgcagccgag 
agtggaattc 
tcgagatggc 
gcctcaatgt 
tcttcatgga 
agttcgggcc 
ggtcgctgac 
ccgccaccca 
tgccgaccga 
cgggcacgac 
gggacctggc 
ccaaggccat 
aggcacgcat 
acgagccgcc 
ccaagctggt 
agatcacctg 
tgtgcatcga 
aggtcttcgg 
tgaagaacac 
agggccgtcg 
agaaggtcat 
tccagacccc 
tttcgtcgct 
agctcggacg 
acgagctgcg 
cggcgatctc 



tcctgctgat 
gtgggagacg 
catggactgg 
catgtacgcg 
gtcgaacgcc 
cgacctgccc 
catggccggt 
gctggaccag 
ctatgtggtg 
gaacgacatc 
tatgcgaatc 
gatttccatt 
ctataccctg 
cgaccagttc 
ggttgccaag 
gaagaacccg 
cgcccaggac 
gggccatacc 
tttcagcgcc 
gcgcgtgatc 
ccgcaaggca 
cgaaaagggc 
cgactccggc 
gctcgatgtg 
caagctgcgc 
ggccgccggc 
cgctggccgc 
acgctacacc 
gcctgaggtt 
tcctcgcatc 
cgccaccgcc 
ggaggagacc 
cgccggcggg 
tcccgacatc 
taaggacggc 
gctggtcaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2187 



<210> 3 
<211> 1851 
<212> DNA 

<213> streptomyces ci nnamonensi s 



<400> 3 

atgacggtcc 

gagcagtggc 

acggccgcag 

tacaccgcgc 

ggttccgtcc 

gccgatcccg 

ctctggctga 

ggcgtctacc 

gcccgggagt 

acgctcggcg 

gcggcggtcg 

accgtggacg 

tcgctcgcca 

aaggccttcg 



tgcctgacga 
accgccttgt 
aggaagcgct 
gcgacgccgc 
ccgagggcaa 
cgcgcaccaa 
cgctgggctc 
tcgacctggt 
tgctgcgcct 
ccgacccgct 
ccgaactggc 
cgctgccgta 
ccggtgtcga 
cgcagctgga 



cgggctttcc 
ggaaggcgtc 
gtccaccacc 
gcccgacgcc 
caccccgggc 
tgaagcggtc 
cgccggtctt 
gcccgtcgcg 
gtacgaggcc 
cggccacgag 
ccggctgtgc 
ccacgaggcg 
gtatctgcgc 
gttcaggttc 



ctggccgccg 
gtgcgcaagt 
ctcgaggacg 
ggtttccccg 
ggctgggacg 
ctcaccgatc 
ccggtcaccg 
ctcgacgcgg 
gcgggcgtcg 
gcccgcaccg 
ggggagcgct 
ggtgcctccg 
gccctgcacg 
gcggccaccg 
Page 



agttccctga 
cgggtaagga 
ggctcaccac 
gcttcgcccc 
tccggcagcg 
tggagaacgg 
gtctggagcg 
gcagcgaggc 
ccgacgacgc 
gggagaagag 
accccggtct 
ccgcgcagga 
acaagggtct 
ccgaccagtt 
2 



tgcgacccat 60 
cgtatcgggg 120 
ccgccccctg 180 
tttcgtcagg 240 
gtacgcgagc 300 
cgtcacctcg 360 
tgcgctcgac 420 
cgcgaccgcc 480 
ggtgcgcggc 540 
cacctccttc 600 
gcgcgcgctg 660 
gctcggcgcc 720 
cggtgtcgag 780 
cctcaccatc 840 



gccaagctgc 
gccgccgggg 
ccgtgggtga 
gacgccgtca 
cgcatcgccc 
gacccggcgg 
tgggacttct 
ctggtcggcg 
cgcgaaccga 
gagcccgcgc 
gaggagctgc 
ttcatcgccg 
ctcttcatgg 
gccgccgagg 
ctctacgccg 
gtgttcctcg 
gacgcggtcg 



gcgccgcgcg 
cgcagcggca 
acatgctgcg 
cggtcctgcc 
gcaacacgtc 
gcggctcctg 
tcaaggagat 
accggatcgc 
tcacgggtgt 
ccgccgcccc 
gcggccgctc 
cgctggggcc 
cgggcggcgt 
cgttcgcggc 
agcaggcgga 
cggggcgcgg 
cggtgctcac 



ccgcctctgg 
gcacgcggtg 
gaccaccgtg 
gttcgaccac 
gacgatcctc 
gtacgtggag 
cgagcgcgcg 
cgcgacctgg 
cagcgagttc 
gcccggcggt 
ggacgcgcac 
ggccgccgcg 
cgagccggtg 
gagcggcgcc 
agcggtcgcg 
ggagttcgcc 
ctccaccctc 



gcgcgtgtcg 
acctcgccgg 
gcgtgcctcg 
gagctcgggc 
ctcgaggagt 
cggctcaccg 
gacggtcagg 
gccgagcgca 
ccgctgctca 
ctgccccggg 
ctggaagcga 
cacacggcgc 
cacgacccgg 
accgtcgcgt 
cgggccctga 
gacatcgacg 
gaccgcatgg 



ccgaggtgtc 
tgatgatgac 
gcgcgggtgt 
tgccggacgc 
cgcacctcgc 
atgaactcgc 
tcgcggcgct 
ggaagaagct 
ccgagcgccc 
tgcgccgcga 
ccggcgcccg 
gcgcgacgtt 
tgtcggtgga 
gcctctgctc 
agtcggcggg 
agtacgtctt 
gagtggcgta 



cggggtgccg 
ccgccgcgac 
gggcggtgcc 
gttcgcgcgc 
gcgtgtcatc 
ccacgcggcc 
ccgctccggc 
ggcccggcgc 
ggtcgagcgc 
cgaggcgtac 
cccgaaggtg 
cgccgccaac 
cgcggagacg 
cagcgacgtg 
cgcgctgcgg 
cgcgggctgc 
a 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1851 



<210> 4 
<211> 2202 
<212> DNA 

<213> Streptomyces ci nnamonensi s 



<400> 4 

atgcggatcc ccgaattcga cgacatcgaa 
gcggagcagt ggcgcgccgc cgtgaaggag 
tgggagacgc ccgagggcat cgcggtcaag 
ctggacttcc tggagacgta cccgggtgtc 
atgtacgtga accagccgtg gacgatccgg 
tccaacgcct tctaccgccg caacctcgcg 
gacctgccca cgcaccgcgg gtacgacagc 
atggcgggcg tggccatcga ctccatctac 
ctggacaaga tgacggtgtc gatgacgatg 
tacatcgtgg cggccgagga gcagggcgta 
aacgacatcc tcaaggagtt catggtccgc 
atgcggatca tctccgacat cttcgcgtac 
atctcgatct ccggctatca catccaggag 
tacacgctcg ccgacggtgt ggagtacctg 
gacgcgttcg cgccgcggct ctccttcttc 
gtcgccaagc tccgcgcggc gcgcctgctc 
aagaacgcca agtccctctc cctgcgcacc 
gcgcaggacg tgttcaacaa cgtcacgcgc 
ggccacacgc agtccctgca cacgaacgcc 
ttctccgcgc ggatcgcccg caacacccag 
cggacgatcg acccgtgggg cggcagcgcg 
cgccgcgcct ggcagcacat cgaggaggtc 
gacgcgggca tcccgaagct gcgcgtcgag 
gactcggggc gccagccggt catcggcgtc 
atcgacgtcc tgaaggtcga caactcctcg 
cgcctgcgcg aggagcgtga cgacgccgcc 
gcggccgagc gtggtcccgg ccagggactg 
gcggcccggg cgaaggccac ggtcggtgag 
cggcacgcgg gccagatccg tacgatctcc 
ccgagcgtgg agcgcacgcg tgccctggtg 
ccgcgcatcc tcgtcgcgaa gatgggtcag 
gcgagcgcct tcgccgacct gggcttcgac 
gcggaggtcg cgcgccaggc cgtcgaggcg 
gccgcagggc acctcaccct cgtaccggca 
gacgacatca tgatcgtcgt gggcggcgtc 
gaggcgggcg ccacggcggt gttcccgccc 
ctggtgaagc gtctggcggc cgacctcggc 



ctgggcgcgg gcggcggccc gtccggctcg 60 
tccgtcggca agtccgagtc cgacctcctg 120 
ccgctgtaca cgggcgccga cgtcgagggc 180 
gcgccgtatc tgcgcggccc ctacccgacg 240 
cagtacgcgg gattctccac cgccgaggag 300 
gcaggccaga aggggctctc ggtcgccttc 360 
gaccacccgc gcgtcaccgg tgacgtcggc 420 
gacatgcgtc agctcttcga cggcattccg 480 
aacggtgccg tgctgcccgt tctcgcgctg 540 
ccgccggaga agctggcggg gaccattcag 600 
aacacctaca tctatccgcc gaagccctcg 660 
acgtcgcaga agatgccgcg ctacaactcc 720 
gcgggcgcga cggccgacct ggagctggcg 780 
cgagccgggc aggaggcggg cctggacgtg 840 
tgggcgatcg gcatgaactt cttcatggag 900 
tgggcgaagc tcgtgaagca gttcgacccg 960 
cattcgcaga catcgggctg gtcgctgacc 1020 
acgtgtgtcg aggcgatggc ggcgacgcag 1080 
ctggacgagg cgctcgccct gccgaccgac 1140 
ctgctcatcc agcaggagtc ggggacgacg 1200 
tacgtcgaga agctgacgta cgacctggcg 1260 
gaggcggcgg gcggcatggc gcaggccatc 1320 
gaggccgcgg cgcgcaccca ggcgcgcatc 1380 
aacaagtacc gggtggacac cgacgagcag 1440 
gtgcgcgccc agcagatcga gaagctgcgg 1500 
tgccaggacg cgctgcgcgc cctgacggcg 1560 
gagggcaacc tgctcgcgct cgcggtcgac 1620 
atctccgacg cactggagag cgtgtacggg 1680 
ggtgtgtacc gcaccgaagc aggccagtcg 1740 
gacgcgttcg acgaggccga ggggcgcagg 1800 
gacggccacg accgcggtca gaaggtgatc 1860 
gtcgacgtcg gcccgctgtt ccagacgccg 1920 
gacgtgcaca tcgtcggcgt ctcctcgctt 1980 
ctgcgcgagg agctggccgc ggagggccgc 2040 
atcccgccgc aggacgtcga ggccctgcac 2100 
ggcacggtga tcccggacgc ggcgcacgac 2160 
cacgaactgt ga 2202 



<210> 5 
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<211> 447 
<212> DNA 

<213> Propionibacterium f reudenrei chi i ssp. shermanii 
<400> 5 

atgagtaatg aggatctttt catctgtatc gatcacgtgg catatgcgtg ccccgacgcc 60 
gacgaggctt ccaagtacta ccaggagacc ttcggctggc atgagctcca ccgcgaggag 120 
aacccggagc agggagtcgt cgagatcatg atggccccgg ctgcgaagct gaccgagcac 180 
atgacccagg ttcaggtcat ggccccgctc aacgacgagt cgaccgttgc caagtggctt 240 
gccaagcaca atggtcgcgc cggactgcac cacatggcat ggcgtgtcga tgacatcgac 300 
gccgtcagcg ccaccctgcg cgagcgcggc gtgcagctgc tgtatgacga gcccaagctc 360 
ggcaccggcg gcaaccgcat caacttcatg catcccaagt cgggcaaggg cgtgctcatc 420 
gagctcaccc agtacccgaa gaactga 447 

<210> 6 
<211> 148 
<212> PRT 

<213> Propionibacterium f reudenrei chi i ssp. shermanii 
<400> 6 

Met Ser Asn Glu Asp Leu Phe lie Cys lie Asp His val Ala Tyr Ala 

15 10 15 

Cys Pro Asp Ala Asp Glu Ala ser Lys Tyr Tyr Gin Glu Thr Phe Gly 

20 25 30 

Trp His Glu Leu His Arg Glu Glu Asn Pro Glu Gin Gly val val Glu 

35 40 45 

lie Met Met Ala Pro Ala Ala Lys Leu Thr Glu His Met Thr Gin Val 

50 55 60 

Gin val Met Ala Pro Leu Asn Asp Glu Ser Thr Val Ala Lys Trp Leu 
65 70 75 80 

Ala Lys His Asn Gly Arg Ala Gly Leu His His Met Ala Trp Arg Val 

85 90 95 

Asp Asp lie Asp Ala val Ser Ala Thr Leu Arg Glu Arg Gly Val Gin 

100 105 110 

Leu Leu Tyr Asp Glu Pro Lys Leu Gly Thr Gly Gly Asn Arg lie Asn 

115 120 125 

Phe Met His Pro Lys Ser Gly Lys Gly val Leu lie Glu Leu Thr Gin 

130 135 140 

Tyr Pro Lys Asn 
145 
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